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^> Abstract. Single-molecule pulling experiments on unstructured proteins linked to neurodegen- 

^ erative diseases have measured rupture forces comparable to those for stable folded proteins. 

^^ To investigate the structural mechanisms of this unexpected force resistance, we perform pul- 

ling simulations of the amyloid (3-peptide (A(3) and a-synuclein (aS), starting from simulated 
r"~i conformational ensembles for the free monomers. For both proteins, the simulations yield a set 

^ of rupture events that agree well with the experimental data. By analyzing the conformations 

PQ right before rupture in each event, we find that the mechanically resistant structures share a 

O common architecture, with similarities to the folds adopted by Ap and aS in amyloid fibrils. 

,^ The disease-linked Arctic mutation of Ap is found to increase the occurrence of highly force- 

qh resistant structures. Our study suggests that the high rupture forces observed in Ap and aS 

pulling experiments are caused by structures that might have a key role in amyloid formation. 

(N 

*^ Keywords: intrinsically disordered proteins, mechanical unfolding, Monte Carlo simulation, 

^\ Alzheimer's disease, Parkinson's disease. 

.^' Introduction 

^T) The amyloid-forming proteins linked to neurodegenerative disorders often belong to 

^~~^ the class of intrinsically disordered proteins [1-4]. Recent single-molecule atomic force 

t> microscopy (AFM) experiments show that, despite their lack of a w^ell-defined folded 

S^ Structure, such proteins sometimes exhibit high mechanical resistance w^hen pulled by 

H an applied force |]5HZl • Rupture forces of similar magnitude are, in fact, only rarely 

observed for folded proteins [jSl [9l . In this paper, w^e attempt to convince the reader 

that subensembles of folded conformations with a specific architecture can explain this 

unexpected force resistance. 

The pathway from a more or less disordered monomer to an ordered amyloid fib- 
ril, with its characteristic cross-P structure, involves a host of intermediate assemblies. 
The exact character of these species, and their possible roles in pathogenesis, is not 
completely understood [110141211 . It is, however, clear that at some point the assem- 
blies become rich in [3-sheets. Structural motifs, and in particular [3-sheets, are key to 
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the mechanical stabiHty of proteins. In general, the response to a stretching force is 
largely dictated by native topology 11131 I14II and pulling geometry 11151 IT611 . and high 
stability can be traced to a specific structural unit, a mechanical clamp [JH]]. Typically, 
these units are |3-sheets, arranged such that the strands cannot be separated without 
breaking a number of hydrogen bonds all at once. The AFM-detected force-resistant 
states of disease-related proteins []5]-[7l] therefore appear likely to contain [3-sheets, and 
could play a vital role in the aggregation process. Knowledge of the structural specifics 
of the force-resistant states could thus provide valuable and general insight into the 
aggregation mechanisms in amyloid diseases. 

In this study, we use atomic-level Monte Carlo (MC) simulations to search for and 
characterize possible mechanical clamps in Alzheimer's amyloid [5-peptide (A[5) and 
the Parkinson's disease-linked a-synuclein protein (aS). Both proteins are unstructured 
and amyloid-forming, and displayed mechanical resistance in AFM experiments [TsHTH . 
Computational modeling offers unique opportunities to interpret this force response, 
but requires the use of large sets of trajectories, due to the structural diversity of the 
proteins. Our calculations are carried out starting from large and diverse sets of initial 
conformations, randomly dravwi from previously simulated ensembles ||17[ I18II . In the 
simulations, high-force rupture events occur for both proteins. We find that these events 
are caused by one specific type of mechanical clamp, which shows similarities to the 
folds adopted by Ap [191 HQB and aS | |21 || in fibrils. Such structures could play an 
important role in the addition of monomers to growing A(3 and aS fibrils, an association 
that requires large parts of the monomers to adopt a specific fold. In addition to the 
wild-type (WT) sequences, we study the disease-related Arctic E22G mutant | |22 || of Ap, 
and find it to increase the occurrence of strong mechanical clamps. 

Methods 

Model. Our simulations are based on an all-atom implicit solvent model with torsional 
degrees of freedom [|23l1 . The energy function consists of four main terms 

The £ev term represents excluded volume effects, whereas Eiq^. handles local interactions 
among neighboring atoms. Hydrogen bonds between backbone NH and CO groups, 
and between charged side-chain groups and the backbone are handled by E^b- The 
last term, E^^, represents interactions between side chains, both hydrophobic attraction 
and attraction/repulsion between charged side chains. The mathematical form and 
parameters of the model are given elsewhere [j23|1 . The computational convenience of 
the model allows us to generate large sets of trajectories, which is important because of 
the conformational polymorphism of the proteins studied. 

One of the big challenges in creating protein models is to obtain a realistic tempera- 
ture dependence. The temperature scale of our model was determined to give correct 
folding temperatures for a diverse set of small peptides [23]. Our simulations have. 
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however, shown that the temperature scale needs recaHbration for the larger Ap and 
aS proteins ||17[ I18II . Below we will both give the nominal temperatures used in the 
simulations and indicate what physical temperatures these correspond to, based on 
comparisons with experimental data. 

Pulling. We model the action of the pulling force by adding a harmonic potential in the 
end-to-end distance of the protein, L(x) (between the first and last backbone atoms), 
where x denotes a protein conformation. The equilibrium length is initially set to the 
end-to-end distance of the starting conformation, Lg = L(xq), and then increases with 
a constant velocity v. The full energy function is given by 

EU^, t) = £(x) + - [Lo + vt - L(x)] ' , (1) 

where £(x) is the energy in the absence of external force, t is MC time and k a spring 
constant. In our calculations, we set k = 37pN/nm and v = 0.05fm/MC step. The 
AFM study of Ap and aS by Hervas et al. [Tl was performed using an estimated can- 
tilever stiffness of 30-70 pN/nm and a pulling speed of 0.4nm/ms. Our v cannot be 
directly translated to an experimental pulling speed, but was empirically chosen based 
on previous simulations of fibronectin domains [24]. Here, we found simulation re- 
sults obtained using v = O.lfm/MC step to be compatible with AFM measurements 
at a pulling velocity of 0.6 nm/ms. Therefore, in our present calculations, we expect 
rupture forces comparable to those measured by Hervas et al. [7], especially since the 
velocity dependence is only logarithmic II25II . Generating Ap and aS unfolding trajec- 
tories at experimental pulling velocities by conventional molecular dynamics methods 
is a challenge, as the required time span of each trajectory is ~ 0.1 s. 

MC details. We simulate the above model using MC dynamics. MC is a crude approxi- 
mation to the real dynamics over short time scales, but we expect MC-based unfolding 
simulations to capture the major free-energy minima and barriers encountered by a 
protein when stretched, provided that only small trial moves are used [26l I27II . Our 
calculations are based on two "small-step" MC updates: rotations of single side-chain 
angles and coordinated Biased Gaussian Steps (BGS) for backbone angles II28II . BGS 
updates up to eight consecutive backbone torsion angles in a manner that keeps the 
ends of the updated segment approximately fixed. The equilibrium length of the har- 
monic spring pulling the protein is increased by a small constant amount in each MC 
step. 

A|5 ensembles. We study the highly amyloidogenic 42-residue form of Ap. In previ- 
ous work, we investigated monomer properties and dimer formation for four variants 
of this peptide: WT, E22G, F20E and E22G/I31E ^7\ |29l. The model temperature 
(nominally 37 °C) was chosen for best match with experimental chemical shifts II30II 
and J-couplings II31II measured at 0-5 °C. The present study is conducted at the same 
temperature, and focuses on the WT and E22G variants. To ensure proper sampling, 
new extended simulations of the free monomers were performed, which indeed agree 
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very well with previous data. Starting from conformations randomly drawn from these 
equilibrium ensembles, a set of 512 independent pulling trajectories was generated for 
each of WT A^ and E22G Ap. 

aS ensemble. In our previous study of the free 140-residue aS monomer, we observed 
two distinct phases: a disordered high-energy phase, D, and a [3-structure-rich low- 
energy phase, B II18II . The transition frequency between these phases was insufficient 
for a determination of their relative fraction as a function of temperature. Our analysis 
therefore focused on single-phase properties, evaluated using a fixed model tempera- 
ture (nominally 56 °C) at which both phases coexisted. The generated D phase ensem- 
ble was found to be in good agreement with NMR data at 15 °C II32II . A consistent 
description of NMR data at -15°C and -10 °C |]33l[34l in terms of the B and D phases 
was, by contrast, found to require a significant B phase fraction, around 50-70%. Our 
pulling simulations are carried out using the same model temperature (56 °C), with ini- 
tial conformations randomly drawn from the two single-phase equilibrium ensembles 
(total number 1024). As in the previous study, the two phases are analyzed separately. 

Analysis. In the pulling simulations, we monitor the external force, F, and the end-to- 
end distance, L. Some examples of force-distance trajectories can be found in Fig. [l| 
In our analysis of the generated trajectories, the data are noise filtered using a sliding 
window of size r„ = 6 x 10^ MC steps. We identify rupture events as sudden drops 
in force with MC time. To discriminate true rupture events from noise, we require the 
force to drop by > 20 pN within a time interval of r„, from a peak value > 20 pN (the 
force can drop to negative values). For each rupture event, we register the maximum 
force, Ff, and the end-to-end distance, L^, at which the peak occurs. 

Secondary structure is classified using the program stride II35II . The ^-structure 
content is calculated as the fraction of residues that are in the class extended. Solvent 
accessibility calculations are performed using the program naccess fl36|] . 

To locate (3-hairpin turns, a contact-based measure of turn propensity, T;, is cal- 
culated. Two residues are said to be in contact if two or more heavy-atom pairs 
are within 4.5 A of each other. The measure T; is defined as the sum of all contact 
probabilities pjj^ in a strip perpendicular to the main diagonal of the contact map, 
2(i-l)<; + fc<2(t + l). 

Figures of three-dimensional structures are drawn using pymol ['37IJ. 

Results 

WT A(5. The Ap molecule is small and natively unfolded, and may seem unlikely to 
exhibit any mechanical resistance. However, in their AFM study, Hervas et al. |T7!l found 
that many unfolding traces contained at least one force peak > 20 pN. The maximum 
recorded rupture force exceeded 300 pN. 

We examine the structural mechanisms causing this force resistance using MC simu- 
lations. As described in Methods, starting from a simulated ensemble of conformations, 
we study the response of the protein when pulled at constant velocity. In some of the 
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Figure 1: Examples of simulated unfolding trajectories. (Upper panel) Force, F, versus end-to- 
end distance, L, in two runs for WT Ap. In one trajectory, no force-resistant state 
is encountered as the molecule is stretched. In the other, a rupture event occurs at 
Lf = 7.5 nm and F^ = 119 pN. The three-dimensional structure is a snapshot just 
before the rupture event. The black and white curves illustrate the noise-filtering used 
when analyzing the data (see Methods). (Lower panel) Similar plot showing two aS 
trajectories with zero and three rupture events, respectively. The snapshots illustrate 
the core structures involved in the rupture events. The first snapshot shows residues 
1-101, whereas the final three-stranded P-sheet is formed by the 33-85 segment. 



simulated trajectories, when stretched, the Ap molecule gets trapped in a force-resistant 
state, the breaking of wrhich gives rise to a force maximum (Fig. [l]). Rupture events, 
signaled by force peaks >20pN, occur in 148 of our 512 unfolding trajectories (see 
Methods for a description of our peak detection protocol). Most of these 148 runs con- 
tain one rupture event, but there are also trajectories w^ith tw^o such events (Table [l]). 
For each rupture event, we record the maximum force, F^, and the end-to-end distance, 
L^, at which the peak occurs. The distributions of F^ and L^ are broad (Fig. [2]), show- 
ing that the observed rupture events are caused by a range of structures with varying 
mechanical strength. Our simulations differ in several details from experiments, where, 
for instance, the molecule of interest is embedded in a multimodular construct. Despite 
these uncertainties, our results are in good overall agreement with those reported by 
Hervas et al. O . The conformational polymorphism that we observe resembles that 
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Protein 



N, 



traj 



N, 



mech 



iV„ 



APWT 


512 


148 (135, 13, 0, 0, 0) 


161 


AP E22G 


512 


172 (144, 27, 1, 0, 0) 


201 


aS, D phase 


449 


2 (1, 1, 0, 0, 0) 


3 


aS, B phase 


575 


563 (37, 205, 274, 43, 4) 


1461 



Table 1: Rupture event statistics. The total number of trajectories (iV^aj), the number of trajecto- 
ries with at least one rupture event (iVjuech)> ^nd the total number of observed rupture 
events (iVgy) for the four systems studied. The numbers of trajectories with 1, 2, 3, 4 
and 5 rupture events are indicated within parentheses in the iVj^ech column. 
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Figure 2: Histograms of rupture force {upper panel), F^, and rupture distance (lower panel) , L^, 
for WT and E22G Ap, based on 161 and 201 rupture events, respectively. 



found in the experiments, and our measured rupture forces are comparable to the ex- 
perimental ones. 

Having seen this agreement, w^e next examine the structures causing the simulated 
rupture events. What conformational features give rise to rupture forces as high as 
210 pN (Fig. |2])? The starting conformations for our pulling simulations are drawn 
from a diverse equilibrium ensemble, generated as described in Methods. Fig. [s] shows 
the free energy of this ensemble, calculated as a function of accessible surface area 
and P-structure content. Two major free-energy minima can be identified, both shal- 
low and broad. One minimum corresponds to single extended (3-sheets and the other 
to more compact, often double-layered structures, with lower accessible surface area 
(<36nm^) and lower |3-structure content (<0.5). The plot symbols in Fig. [s] repre- 
sent the conformations from which our pulling simulations are started, and indicate the 
magnitude of the maximum rupture force along the respective trajectories. A majority 
of the trajectories containing rupture events start out from the free-energy minimum 
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Figure 3: Equilibrium free energy, F[ASA,P), calculated as a function of accessible surface area, 
ASA, and ^-structure content, (3, for WT Ap. Plot symbols represent starting structures 
for the pulling simulations, using averages of ASA and p over a short time interval (6 x 
10^ MC steps). The plot symbol indicates whether the maximum rupture force along 
the corresponding pulling trajectory is <20pN (triangle), 20-150 pN (diamond), or 
> ISOpN (circle). The free energy has two shallow minima. The three-dimensional 
structures represent these minima. 



corresponding to single extended P-sheets. Conformations from the other minimum 
have shorter P-strands and are typically unable to withstand high forces. 

To pinpoint the structural elements providing mechanical resistance, w^e examine 
the conformations sampled just before the rupture events. An analysis of secondary- 
structure content before rupture shows that high rupture forces occur only if the P- 
structure content is high (Fig. |4]). For a more detailed picture, we divide the rupture 
events into four classes based on P-sheet geometry, as indicated by the plot symbols in 
Fig. |4) The first two classes correspond to three-stranded P-sheets with specific turn 
locations (turns are identified using the measure T; defined in Methods). The turns are 
located in the 13-15 and 23-24 intervals for the first class, and in the 13-15 and 25- 
27 intervals for the second class. The third class represents all three-stranded P-sheets 
with other turn locations. All these three classes contain many high-f^ events, and 
the average F^ is > 100 pN in all three cases. The fourth and last class collects all the 
remaining rupture events. The corresponding structures are often double layered, and 
sometimes contain smaller P-sheets. For this class, the average rupture force is ~ 40 pN. 
This analysis confirms that most of the main mechanical clamps are three-stranded P- 
sheets. Typically, they have a simple meander topology. P-sheets with an odd number 
of strands in a meander pattern are indeed obvious candidates for mechanical clamps. 
Unlike a P-hairpin, they cannot be unzipped one hydrogen bond at a time. 

The three-stranded P-sheets causing rupture events share similar sequence locations, 
as one might expect for a small protein such as Ap. To quantify this, we show in 
Fig. [5] a ^-structure profile, obtained by averaging over structures just before rupture 
in all events involving three-stranded P-sheets. The profile shows that there indeed are 
statistically preferred turn and strand regions. In addition, it can be seen that the second 
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Figure 4: Scatter plots of rupture force, F^, and P-structure content, p, for WT {upper panel) and 
E22G {lower panel) Ap. p is an average over a short time interval (6 x 10^ MC steps) 
just before the rupture event. The plot symbols indicate four conformational classes. 
The first two correspond to three-stranded P-sheets with specific turn locations. The 
third class represents extended three-stranded P-sheets with other turn locations. The 
"other" category consists of structures with at most two extended P-strands, some of 
which are double-layered. 



and third strand regions overlap w^ith the tv\ro strand regions in A^ fibrils ||19[ I20II . 

For a P-sheet to display mechanical resistance, its strands must be sufficiently long. 
In the case of A|3, the required strand length can be achieved only if a large part of the 
whole molecule participates in the |3-sheet. In fact, among the observed force peaks 
> 100 pN for the tw^o A|3 variants, almost all are caused by structures with a ^-structure 
content >0.5 (Fig. Q. By forming such structures, it becomes possible for Ap, despite 
its small size, to withstand these high forces. 

It is worth noting that these P-sheets still do not have maximal size. In WT Ap 
simulations at a lower temperature, we observed even larger three-stranded P-sheets 
(P-structure contents of around 0.8). In pulling simulations started from these confor- 
mations, we recorded rupture forces of up to 300 pN. However, these conformations are 
not significantly sampled at the temperature used in the present study. 
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Figure 5: Average P-structure profiles for WT Ap {upper panel) and aS Qower panel) rupture 
events involving three-stranded P-sheets. For each event, residue-specific P-structure 
propensities are computed over a short time interval (6 x 10^ MC steps) just before 
rupture. A second average over events gives the final profile. Horizontal bars in the 
upper panel indicate strand regions in Ap fibrils as obtained by Petkova et al. [ 19 ] and 
Liihrs et al. [20j. In the lower panel, aS fibril strand regions [|21|| are shaded gray. 



Arctic A|5. Having identified candidate structures for mechanical resistance, it is still 
not clear what relevance these have for aggregation and disease. A natural next step 
is to study how^ the force response is altered by disease-related mutations, such as the 
(Arctic) E22G mutation. The experiments by Hervas et al. QTI] found this mutation to 
lead to an increased occurrence of force-resistant states. The highest force peak w^as 
> 400 pN, indicating the breaking of a "hyper-mechanostable" conformation. 

We study E22G A[3 using the same setup as for WT A|3, and see many similarities 
between the two variants, as is expected from a single point mutation, but also some 
important differences. Compared to WT, we observe 25 % more rupture events for E22G 
and 15% more mechanically resistant initial conformations (Table [l]). In particular, 
there is an increase by 60 % in the number of events with F^ > 150 pN for E22G (40 vs 
25). Our highest f J- is ~ 260 pN for this variant, compared to ~210pNfor WT (Fig. [2]). 

The increased number of high-Fj. events for the E22G mutant can, at least partly, be 
traced to a conformational difference noted in our previous work 11171 12911 . Here, we 
compared the propensities of the four variants WT, E22G, F20E and E22G/I31E to form 
turns centered in the 25-30 interval, which is the loop region of the [5-loop-(3 motif in A[3 
fibrils [191 l20l I38II . The probability for such turns was found to correlate with the rate 
of fibril formation, and was, in particular, higher for E22G than for WT QlTl HU . If the 
A[3 molecule is to form a large and potentially force-resistant three-stranded ^ -sheet, 
the 25-30 region is a favorable location for the second turn. 
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Figure 6: Aligned WT Ap structures before pulling (left) and at rupture (right) for the 28 ob- 
served rupture events in the class with turns at 13-15 and 25-27 (squares in Fig.|4]). 
For the initial structures residues 10-35 were aligned, for the other case residues 5-35. 
Residues 13-15 are in red and residues 25-27 in blue in both alignments. 



The high-ff events for E22G can to a large extent be linked to three-stranded P- 
sheets with ^-structure contents of ~0.7 (Fig. [4]). Many of the conformations are in 
the class with the turns located in the 13-15 and 25-27 regions. In line with the 
above-mentioned analysis |)171 I29II , we find that the number of rupture events in this 
conformational class increases from 28 for WT to 53 for E22G. Fig. [6] shows aligned 
initial conformations and conformations right before rupture for this class, illustrating 
how the simulations start from a diverse set of conformations which "condense" to one 
single structure when pulled. 

aS. AFM experiments by different groups have shown that the Parkinson's disease- 
related aS protein can display mechanical resistance [l5] - [7[l . The set of around 100 
pulling trajectories generated and analyzed by Hervas et al. p7] included two in the 
hyper-mechanostable category (F^ > 400 pN). 

Our aS study is based on a set of 1024 constant-velocity pulling simulations, two 
of which are shown in Fig. [ij As in the A^ study, initial conformations are randomly 
drawn from a simulated ensemble for the free monomer II18II . The aS ensemble can be 
split into two phases: a disordered phase, called D, and a ^-structure-rich phase, B (see 
Methods for a detailed description). Because the phases are very different, we analyze 
the two subsets of trajectories separately. 

We find that the D phase displays virtually no force resistance. Force peaks > 20 pN 
occur in only 2 of our 449 trajectories started from this phase. In both these runs, 
the initial energy is close to the cutoff used to distinguish the phases. Among our 575 
trajectories started from the B phase, there are, in contrast, only 12 without any force 
peak > 20 pN. The typical B phase conformation exhibits one large P-sheet with several 
strands, often arranged in a simple meander pattern II18II . When stretched, it breaks up 
through a sequence of usually two or three rupture events (Table [l]), at each of which a 
few strands are lost (Fig.[l]). Force peaks occurring later in time tend to be higher than 
earlier ones, at least in part because the strands can be more parallel to the applied force 
as they get fewer in number. In most runs (> 90%), the last force peak is associated 
with the breaking of a three-stranded ^-sheet. 

In accord with the observation of conformational polymorphism in AFM experiments 
QtB, our in total 1464 aS rupture events show a wide variation in both Fj. and L^ (Fig.JTJ). 
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Figure 7: Histograms of rupture force, F^, and rupture distance (inset), L^, for aS, based on a 
total of 1464 rupture events, the vast majority (1461) of which are from trajectories 
started from the B phase. 



The f J. distribution starts to fall off at ~ 300 pN. The maximum Fj. is ~ 740 pN, but there 
are only seven events with F^ > 500 pN. The L^ distribution exhibits a peak betw^een 
35-40 nm that can be associated w^ith three-stranded ^-sheets, whereas structures with 
more strands tend to break at lower L^. 

Because aS is much larger than A|3, with 140 amino acids instead of 42, a mechan- 
ical clamp in aS does not have to involve the whole molecule. To locate the parts of 
aS forming clamps in our simulations, we compute a p-structure profile that represents 
an average over all rupture events caused by three-stranded ^-sheets (Fig.js]). This pro- 
file shows that, although their exact position varies, the force-resistant three-stranded 
P-sheets have a statistically preferred sequence location, which is the ~ 35-65 region. 
In aS fibrils, each molecule is believed to participate in five face-to-face stacked inter- 
molecular P-sheets | |21 || . Experiments suggest that the five strand regions include the 
respective stretches 39-41, 52-56, 62-66, 70-77 and 90-94 [21 J, which are shaded 
gray in Fig. |5} Our calculated ^-structure profile shows clear similarities with the pro- 
posed fibril strand locations. The profile has its three highest peaks in the first three 
fibril strand regions, and shows signs of shoulders in the remaining two of these re- 
gions. As the topology is also the same, a meander pattern, this finding implies that the 
mechanical clamps observed in our simulations show similarities to the fibril fold. In 
particular, our results suggest a key mechanical role for the part of aS which forms the 
first three fibril strands. 

Discussion 

AFM experiments on the unstructured Ap and aS have observed a very broad range 
of rupture forces, including values of up to 300^00 pN OHHtI]. The question arises 
whether these high forces may be due to artifacts such as topological entanglements. 
One important result of our study is therefore the finding that even the short Ap peptide 
can form structures with high mechanical resistance on its own. 

There have been previous simulations of other mechanical aspects of proteins impli- 
cated in neurodegenerative diseases II39I l40ll . We are the first, however, to perform 
calculations that can be directly compared to the AFM pulling experiments on Ap and 
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aS. We can thereby study structural components of the unexpected force resistance 
found in those experiments. 

The force spectra observed in our simulations are comparable to those of the experi- 
ments. Furthermore, the vast majority of our high-force rupture events, for both A(3 and 
aS, involve the same type of mechanical clamp, namely a three-stranded P-sheet with 
meander topology. The A^ trajectories only rarely contain more than one force peak. 
For aS, w^e often observe two or three force peaks, the last and highest of which can 
be associated with a three-stranded |3-sheet. In hindsight, the form found for the main 
mechanical clamps appears plausible, especially for A[5. Alternative folds with the same 
mechanical strength are not easily conceived for a molecule this size. 

Mechanical resistance usually stems from [5-sheet structure. On general grounds, 
one might therefore expect a correlation between the force resistance of a protein and 
its propensity to form amyloid fibrils (which largely consist of ^-sheets). In line with 
this picture, and with experiments []7l|, we find that the aggregation-enhancing E22G 
mutation increases the mechanical stability of A[5. Its higher force resistance can be 
linked to an increased occurrence of conformations (Fig. |4]) that not only have a high 
overall [5-structure content, but also display a turn with similarities to the [5-loop-P motif 
in fibrils ||19[l20[|38ll . Similarly, for aS, the most common positions of ^-strands in the 
strong mechanical clamps (Fig. [s]) agree very well with the first three strand regions 
in the proposed fibril model | |21 | | . The meander-like paths traced out by the A|3 and 
aS backbones in these force-resistant structural units thus show similarities with the 
proposed fibril folds. These findings hint at a direct structural link between the force 
resistance of these proteins and their amyloid propensity. 

In the A|3 AFM experiments, the fraction of trajectories having at least one force peak 
>20 pN was 33% and 62% for WT and E22G, respectively \^. In our simulations, 
the corresponding numbers are 29 ± 1 % and 34 ±4%. A main parameter influencing 
the amount of force-resistant states is the temperature. Comparison with experimental 
NMR data suggests that our simulation temperature corresponds to 0-5 °C ||17l| . The 
AFM study was done at room temperature, but the sample was stored at 4°C between 
experiments (R. Hervas, personal communication, 2013). The ensemble probed in the 
experiments could be influenced by this lower temperature, due to slow thermalization 
of Ap when anchored to a molecular construct. Hence, there are uncertainties about the 
precise relation between the simulated and experimental ensembles. It is remarkable 
that the simulated and experimental fractions of trajectories containing rupture events, 
nevertheless, are within a factor of two of each other for both WT and E22G. 

The aS AFM experiments, also performed at room temperature (R. Hervas and M. 
Sandal, personal communications, 2013), found 7-45% of the trajectories to contain 
rupture events []5l [7|. In our aS simulations, this fraction is < 1% and 98% for the 
D and B phases, respectively. Hence, for a force response matching experimental data, 
a significant B phase fraction must be assumed. As previously shown II18II . a B phase 
fraction of 50-70 % is compatible with NMR data at-10°Cand-15°C[l33l[3l|. How- 
ever, at room temperature, the free aS monomer is disordered [|4TT1 , and NMR data at 
15 °C II32II are well described by the D phase alone [18.1 . 
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How does one then explain the mechanical resistance seen in room temperature AFM 
experiments on aS? Two factors that could affect the AFM ensemble, and push it toward 
the B phase, are as follows. First, as discussed for Ap, due to the kinetically restrictive 
anchoring to a construct, the AFM ensemble might retain some memory of the low 
temperature, 4°C, at which the samples were held between experiments. Second, the 
anchoring to the molecular construct imposes a constraint on the end-to-end distance, 
which might disfavor the D phase; the simulated B and D phases have average end-to- 
end distances of 5.7 nm and 8.1 nm, respectively. We note that Hervas et al. ^, who 
used a stricter construct, recorded a higher fraction of trajectories containing rupture 
events, compared to Sandal et al. [5j. 

The heterogeneous mechanical response revealed by AFM studies QsHZll is not the 
only indication that ^-sheet-containing states are readily accessible to the Ap and aS 
monomers. A finding supporting this view is the NMR-derived structure of A|3 in com- 
plex with an Affibody protein 114211 . ^-sheets formed by the monomers are, of course, 
different than the intermolecular ones found in fibrils, but transient sampling of such 
structures might, nevertheless, be important in fibril formation. The addition of a ran- 
dom coil monomer to a growing fibril entails a high cost in conformational entropy, un- 
less the chain is very short. This cost is in part balanced by intermolecular interactions, 
but a free-energy landscape where intramolecular interactions funnel the monomer to- 
ward the fibril fold might be crucial for efficient monomer-fibril association. 

Simulation studies of A(3 have been reported by many groups [for recent examples see 
references 1431 - 15011 . which typically observed only very limited amounts of |3-structure. 
We, however, got the best agreement with experimental chemical shifts [30] and J- 
couplings II31II for an ensemble with a significant fraction of ^-sheet-rich states II17II . 
In the pulling simulations presented here, the same [3-sheet-rich states can explain the 
high rupture forces observed in AFM experiments (Fig. [3]). We therefore take the com- 
parisons with AFM data as an independent indication that A^ need not be as coil-like 
as commonly thought. 

Conclusion 

We have examined the structural mechanisms providing the surprisingly high mechani- 
cal stability of the unstructured proteins Ap and aS observed in AFM experiments QHHZl) 
using a large number of pulling simulations with initial conformations randomly drawn 
from simulated ensembles. Our main findings are as follows, (i) For both A(3 and aS, 
structures with high force resistance do indeed occur in the simulations, and the main 
type of mechanical clamp is a P-sheet with three strands of sufficient length, arranged 
in a meander pattern, (ii) The Arctic mutation of Ap leads to increased occurrence of 
highly force-resistant structures, (iii) The mechanically stable structures of both Ap and 
aS show similarities with the respective fibril folds. These findings suggest that P-sheet- 
rich force-resistant structures are readily accessible to Ap and aS and might have a key 
role in amyloid formation. 
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